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STUDY OF THE PREDICTION OF MANPOWER CHANGE BEHAVIOR 



USING REGRESSION METHODS 



R. R. Read 

Naval Postgraduate School 
Monterey, California 



ABSTRACT 

It is shown that the use of regression methods in the 
forecasting of Separations (EAOS) , Eligibles (to reenlist) 
and Non-reenlistments jointly by length of service and pay 
grade are competitive with the currently used "alpha" 
method. The question of whether one of the two methods of 
forecasting is clearly superior could not be addressed with 
the currently available data. The report describes the data 
base, presents various general characteristics of the data, 
summarizes the computational results that lead to the 
recommended choices of input, and recommends follow-on work 
to clarify the issues. 



I. Introduction 



Previous work explored the use of regression techniques 
(especially ridge regression) in the forecasting of contract 
losses, gains, and attritions by a set of LOS (length of service) 
categories and (separately) by PG (pay grade) categories. For 
sake of immediate reference, the results are included as 
Appendix A of the present report. The reader is referred to 
its first four pages for the definition of terms whose use 
continues in the current work. 

The current follow-on efforts ignore the gain and 
attrition quantities in order to focus on three important 
components of contract losses: 

S = Separations (EAOS) 

Y = Eligibles 

V = Non-Reenlistments 

At the same time the forecasts are more refined in that they 
must be made jointly by 31 LOS cells and 7 PG cells. Pay 
grades El to E3 are lumped together in the first cell and 
pay grades E4 thru E9 form the remaining six cells. 

In addition the current study utilizes an increased 
and modified data base. Now there are eleven years of data 
(1966 to 1976 inclusive) and the definition of LOS has 
changed, the new data reflecting this change and some other 
changes whose nature is not explicitly known to the author. 

These changes make obsolete the specific results of the previous 
work . 
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The updating of the data base has also had an effect 
on the currently used method (which we call the alpha method) 
for forecasting the three object variables S, Y, and V. 

The results of this method are available only for 1976, making 
difficult the comparison of the regression forecasts with 
current method forecasts since no actuals (1977) are yet 
available. Hence a fair comparison of the two methods cannot 
be made at this time. 

A biased comparison is made instead. All eleven years 
data is used to develop the (1976) forecast coefficients for 
both methods, but are applied to the previous year's data (i.e 
1975 serving as input data) to forecast 1976. Thus the 1976 
data can serve as the actuals. The comparison is compromised 
because the 1976 data was used also to produce the forecast 
coefficients. Some reasons why this deficiency may favor the 
alpha method will be suggested further on in the report. 

The regression methods involve the use of p input 
variables p ranging over 2, 3, and 4 and the most favorable 
set of p variables is always selected. Two measures of 
comparisons are computed: 

MAE = mean absolute error 
RMSE = root mean square error 

where the errors are the differences between actuals and 
forecasts, and the means are computed over the 217 (31 x 7 ) 
cells. The results are summarized in Table 1.1. It is con- 
cluded that the regression method is competitive with the 
currently used alpha method. 
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TABLE 1.1 



COMPARISON OF SUMMARY MEASURES OF FORECAST ERROR 
FOR THE 'ALPHA METHOD' AND THE 
BEST MULTIPLE REGRESSION USING p VARIABLES 







alpha 


P = 2 


P = 3 


p = 4 




MAE 


123.5 


161.5 


146.4 


118.4 


Separations 




RMSE 


496.0 


651.7 


576.4 


436.7 




MAE 


104.6 


120.2 


88.3 


75.9 


Eligibles 




RMSE 


380.6 


469.4 


292.9 


232.8 




MAE 


50.6 


77.2 


54.9 


40.3 


Non-Reenlistments 




RMSE 


291.8 


338.0 


238.7 


199.7 



MAE = mean absolute error 
RMSE = root mean square error. 
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The report is organized as follows. A description of 



the new data base and the anomolies remaining in it is contained 
in Section II along with some general comments about what the 
author knows about the alpha method. Section III contains some 
comparisons over time of the macro behavior of the five 
important variables contained in the new data. Some interpre- 
tations and speculations are made. Section IV contains the 
refined details of the comparison of regression methods and the 
alpha method. Further interpretations are made. Conclusions 
and recommendations follow. 

As mentioned earlier, the report of the previous work 
appears in Appendix A. Eleven year means of the three object 
variables appear in Appendix B. Appendix C contains APL programs 
pertinent to this report. 
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II . Description of the Data and the 'Alpha* Method 



The new data contains eleven years' (1966 to 1976 in- 
clusive) data for the five variables 

V = non-reenlistments 

S = separations (EAOS) 

Y = eligibles 

X = retentions 

T = inventory (total) 

for each of the original 279 (= 31 x 9) LOS/PG cells. (The 
telescoping of pay grades E1-E3 is done later when forecasting.) 

All of the eleven years' data have been reworked to 
accommodate a new definition of LOS. The old definition was 
based upon pay entry base data which included pay credits for 
other federal service. The new definition refers to TAFMS 
(total active federal military service) . Although the exact 
meaning is unknown to the author fit is known that reserve duty 
does not count. (It appears that the data of actual entry is 
used by no one.) The new definition affects our previous LOS 
entries quite noticeably. 

Previously, the variable V (non-reenlistments) was 
a derived quantity being the difference of eligibles and 
retentions . Now it is obtained independently in some way and 
there is noise in the relationship V = Y-X. This and other 
data anomolies are described next. 
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Each of the five variables V, S, T, X, T are recorded 



in 3069 (= 11 x 31 x 9) cases. Separations exceed Eligibles 
(i.e. S ^ Y) in all of them, as is proper. However eligibles 
fail to exceed retentions (Y 2l X) in 97 (of 3069) cases. Of 
these, 44 failures are in the most recent two years (1975-76) 
and in PG, E8 and E9. PG E7 contains some concentration of these 
also, but the remainder appear to be scattered. The non-reenlist- 
ment variable fails to be non-negative (V ^ 0) in 107 cases. 
Again 44 of these failures are in the last two years and for 
E8 and E9. In fact these four cells (summed over LOS) have the 
exact same distribution as the previous analomy (Y / X) . 

Again E7 has a number of negative entries The differences 
between separations and retentions (contract losses) should be 
non-negative. This fails (S / X) in 76 cases. The noise is 
concentrated in PG E7, E8, E9 and years 1975-76 which accounts 
for 52 of the 76 cases. The relationship V = Y-X (i.e. 
non-reenlistments form the difference between eligibles and 
retentions) holds up in 2256 of the 3069 cases (74%) . The 
discrepancies cluster mostly in E4 to E7 and the first 20 LOS 
categories . 

Finally, the new data appear to have larger inventories 
in 1966-67. Specifically 591 x lo^ vs 586 x lO^ for 1966 and 
662 X 10^ vs 653 x IQ^ for 1967. 

The currently used method of forecasting the object 
variables is a "black box" as far as the user is concerned. 

It requires the production of "alpha matrices" whose elements 
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serve as multipliers to convert the input into the forecast. 
Three such 31 by 7 matrices are available to the author, 

^ ^ A 

Ug/ otg, and are used as follows: Let S, Y, V represent 

the output projections for the next full time period. Then 



S 






Y 



Vo 



V 






where T^ is the matrix of most recent inventories, Sq is the 
matrix of separations for the most recent period, and Yq is 
the matrix of eligibles for the most recent period. The 
indicated multiplications of matrices are elementwise. 

The alpha matrices are updated each year when the data 
become available. The technique does not appear to be well 
documented but is available in the form of computer programs. 

It is believed to be a version of "Tukey's Smoothing Medians" 
and Tukey's materials on exploratory data analysis may be 
useful in tracking it down. (See also: McNeil, Interactive 
Data Analysis . ) Comments concerning the application of it to 
the problem at hand are contained in the informal papers 
entitled 'Introduction to Smoothing and Projection of Naval 
Population Matrix Time Series,' and 'Rifselsm Overview.' 

These do not appear to be very useful to the analyst. 

The three 1976 alpha matrices used in this study appear 
in Tables 2.1 to 2.3. The values have been converted to 



percentages and rounded. 



TABLE 2.1 



ALPHA MATRIX MULTIPLIERS TO FORECAST 
SEPARATIONS FROM INVENTORY (includes 1976) 



PG 



LOS 



2 


5 


19 


29 


58 


3 


2 


12 


27 


74 


46 


45 


2 


23 


37 


28 


33 


43 


38 


1 


93 


67 


73 


93 


79 


29 


10 


79 


25 


25 


17 


22 


36 


30 


2 


41 


41 


29 


38 


31 


47 


99 


14 


1 8 


16 


23 


27 


21 


4 


18 


22 


22 


32 


32 


41 


5 


11 


22 


25 


32 


52 


32 


1 0 


23 


29 


32 


36 


44 


26 


18 


16 


21 


2 3 


3 0 


43 


12 


6 


36 


32 


28 


33 


4 4 


48 


11 


14 


18 


24 


3 4 


38 


37 


23 


25 


24 


27 


32 


29 


32 


45 


28 


22 


26 


31 


29 


32 


35 


30 


35 


31 


28 


29 


30 


33 


23 


16 


19 


23 


25 


28 


26 


65 


11 


15 


16 


17 


17 


17 


3 


7 


7 


10 


16 


19 


22 


1 


1 


7 


12 


19 


22 


21 


1 


26 


1 0 


16 


18 


19 


21 


1 


46 


2 5 


16 


18 


23 


19 


1 


2 


49 


15 


22 


28 


31 


1 


1 


74 


2 0 


21 


19 


21 


1 


1 


82 


42 


1 7 


17 


19 


1 


4 


29 


58 


22 


16 


21 


1 


1 


13 


27 


17 


27 


29 


5 


1 


3 


21 


15 


15 


12 


1 


1 


1 


8 


7 


3 


5 


1 


1 


74 


1 7 


5 


7 


9 


1 


2 


13 


27 


8 


1 0 


14 



8 



TABLE 2.2 



ALPHA MATRIX MULTIPLIERS TO FORECAST 
ELIGIBLES FROM SEPARATIONS (includes 1976) 



LOS 



38 


79 


85 


100 


100 


3 


99 


48 


9 8 


100 


100 


82 


14 


1 


37 


98 


99 


100 


95 


3 


93 


41 


98 


100 


98 


100 


14 


1 


53 


98 


99 


100 


100 


100 


2 


74 


9 7 


99 


100 


100 


100 


100 


78 


96 


99 


100 


9 5 


99 


56 


56 


9 7 


99 


100 


1 00 


98 


14 


68 


95 


99 


100 


100 


100 


14 


64 


96 


99 


100 


100 


95 


56 


50 


96 


99 


100 


100 


100 


96 


59 


9 3 


99 


100 


100 


100 


12 


51 


94 


99 


100 


100 


100 


56 


68 


86 


99 


100 


100 


100 


100 


76 


9 7 


99 


100 


100 


100 


100 


9 5 


97 


100 


100 


100 


100 


1 00 


43 


93 


100 


100 


100 


100 


100 


78 


100 


100 


1 00 


100 


100 


100 


16 


100 


9 8 


1 00 


100 


100 


1 00 


1 


5 


94 


100 


100 


10 0 


100 


1 


99 


100 


100 


100 


100 


100 


1 


9 5 


91 


100 


100 


100 


100 


1 


83 


95 


100 


100 


100 


100 


1 


45 


85 


100 


100 


100 


100 


1 


1 


95 


100 


100 


97 


1 00 


0 


83 


1 00 


100 


100 


100 


100 


0 


1 


17 


100 


100 


100 


100 


0 


0 


17 


100 


100 


100 


100 


0 


1 


1 


100 


100 


100 


100 


0 


0 


87 


99 


100 


99 


95 


0 


2 


96 


99 


97 


100 


100 
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TABLE 2.3 



ALPHA MATRIX MULTIPLIERS TO FORECAST 
NON-REENLISTMENTS FROM ELIGIBLES (includes 1976) 



PG 



LOS 



70 


43 


39 


57 


62 


3 


12 


89 


68 


14 


48 


5 


2 


1 


81 


61 


36 


21 


86 


4 


3 3 


81 


7 5 


60 


19 


54 


15 


2 


56 


5 2 


46 


40 


53 


99 


3 


30 


60 


68 


70 


52 


10 


26 


35 


37 


46 


46 


19 


5 


5 6 


18 


2 8 


32 


38 


4 


31 


5 


38 


2 0 


26 


22 


3 


5 


5 


42 


20 


23 


18 


7 


8 


4 


2 


20 


14 


14 


6 


2 


7 


26 


8 


1 5 


1 0 


7 


14 


5 


1 


12 


1 0 


6 


4 


1 


3 


16 


2 3 


5 


4 


2 


4 


5 


3 4 


7 


5 


4 


1 


1 


2 


2 


9 


4 


2 


1 


2 


1 


45 


1 


5 


1 


1 


1 


1 


1 


2 3 


2 


1 


1 


1 


1 


1 0 


15 


6 


2 


1 


1 


1 


1 


3 


21 


3 


2 


2 


2 


1 


31 


46 


5 


4 


1 


1 


1 


46 


39 


7 


4 


4 


1 


1 


7 0 


3 4 


6 


1 


1 


1 


1 


46 


1 


1 


2 


1 


1 


1 


1 


1 


16 


5 


1 


1 


1 


70 


2 


19 


7 


3 


1 


1 


1 


1 


2 


1 


1 


2 


1 


1 


2 


22 


5 


1 


1 


1 


1 


3 


4 


2 


1 


1 


1 


1 


1 


1 


18 


1 


2 


1 


1 


1 


2 


8 


1 


1 
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III. Plots Illustrating the Macro Behavior of the Data . 

The five variables were summed over LOS and PG, and 
their behavior has been plotted against time, against each 
other for selected pairs, and the first four against time as 
a fraction of Inventory. These plots, together with comments 
about them, follow. The totals appear first as Table 3.1. 



TABLE 3.11 





V 


S 


Y 


X 


T 


1966 


73319 


120368 


11. 7929 


43409 


59146 3 


1967 


99598 


1 539 64 


150426 


49604 


662056 


1968 


1 012 81 


136697 


133677 


31 34 8 


663779 


1969 


125236 


161051 


15 ROOP 


31 897 


673589 


1970 


138025 


191950 


1 80070 


41599 


684109 


1971 


87690 


155077 


1 30599 


42 573 


605 89 8 


1972 


74181 


134110 


120458 


4 5 8 6 5 


542298 


1973 


81670 


14 79 0 8 


136894 


543 83 


510669 


1974 


55365 


122347 


111 399 


55538 


490009 


1975 


45815 


11 8115 


102735 


55 625 


474735 


1976 


47471 


112983 


9 7 3 0 


45 63 9 


465748 
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FIGURE 3.1 



INVENTORY VS TIME 
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Total stocks of Navy enlisted personnel during the 
Vietnam War period and the decline after that period are 
illustrated in Figure 3.1. Total Separations (EAOS) are shown 
in Figure 3.2. The spike for 1970 surely represents an 'end 
of the war' idiosyncrasy. Separations as a fraction of stocks 
(Figure 3.3) shows two spikes 1970 and 1973. We do not know 
how much the latter one affects the alpha matrix coefficients 
generated to produce a forecast of 1977 EAOS as a multiple 
of stocks. 
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FIGURE 3.2 



SEPARATIONS VS TIME 
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FIGURE 3.3 
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The next four macro plots are scatter diagrams of 
separations with each of the other four variables. They show: 

1) a moderate correlation of separations with inventory, 

2) stronger correlations of separations with non-reenlistments 
and eligibles, 

3) virtually no correlation between separations and retentions. 
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FIGURE 3.4 



SEPARATIONS VS INVENTORY 
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FIGURE 3.5 

NON-REENLISTMENTS VS SEPARATIONS 
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FIGURE 3.6 



RETENTIONS VS SEPARATIONS 
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FIGURE 3.7 

ELIGIBLES VS SEPARATIONS 
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FIGURE 3.8 



ELIGIBLES AS A FRACTION OF SEPARATIONS VS TIME 
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Figure 3.8 has special interest since the 'alpha method* 
forecasts eligibles as a proportion of separations. Cyclic 
behavior begins about 1970 (post war) . The 1977 forecast is 
expected to continue the downtrend exhibited above in the more 
recent years. Indeed, based on this graph the forecast for 
1976 is expected to be rather good. Time series smoothing 
methods have a tendency to do well when the trend continues, 
but they are caught when the series either tops or bottoms out. 
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FIGURE 3.9 



ELIGIBLES VS TIME 
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FIGURE 3.10 

ELIGIBLES VS INVENTORY 
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FIGURE 3.11 



RETENTIONS VS ELIGIBLES 
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figure 3.12 

NON-REENLISTMENTS VS ELIGIBLES 
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FIGURE 3.13 



ELIGIBLES AS A FRACTION OF INVENTORY VS TIME 
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Figures 3.9 thru 3.13 focus on the macro behavior of 
eligibles. This variable is more directly related to policy 
than are the others. Figure 3.13 suggests that the relevant 
policy is not very stable. Again the spikes of 1970 and 
1973 are the most disruptive. Figure 3.11 suggests a poor 
but negative correlation of eligibles with retentions. 

Figures 3.7, 3.10 and 3.12 indicate important positive 
correlations of eligibles with separations, inventory and non- 
reenlistments. Figure 3.9 contains little new information 
in the light of the others. 
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FIGURE 3.14 



NON-REENLISTMENTS AS A FUNCTION OF ELIGIBLES VS TIME 
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The 'alpha method' forecasts non-reenlistments as a 
proportion of eligibles. This signal appears to have bottomed 
out in 1975 and the method is presumed to respond to this. 
Although we would expect the 1976 forecast (which is based on 
data thru 1975) to be poor because of the tendency to over swing, 
forecasts using data thru 1976 should not suffer as much. 
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FIGURE 3.15 



NON-REENLISTMENTS VS TIME 
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FIGURE 3.16 

NON-REENLISTMENTS AS A FRACTION OF INVENTORY VS TIME 
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FIGURE 3.17 



NON-REENLISTMENTS VS INVENTORY 
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FIGURE 3.18 

NON-REENLISTMENTS VS RETENTIONS 
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Figures 3.15 and 3.16 show macro behavior of non-reenlist- 
ments and non-reenlistment rate as a function of time. They are 
readily interpretable in the light of war and post-war years. 
Figures 3,5, 3.12, 3.17, and 3.18 are the scatter plots of 
non-reenlistments with each of the other four variables. They 
show positive correlations of non-reenlistments with inventory, 
separations, and eligibles, while the relationship with 
retentions is negative and appear noisy. Recall that the 
relationship V = Y-X breaks down in 26% of the cases. 



FIGURE 3.19 
RETENTIONS VS TIME 
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FIGURE 3.20 



RETENTIONS AS A FRACTION OF INVENTORY VS TIME 
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FIGURE 3.21 

RETENTIONS VS INVENTORY 



60000 



o o 



50000 



40000 



300001 

450000 



I I 

500000 



I I 

550000 



o o 



600000 650000 700000 



25 



For completeness, the macro behavior of retentions is 
included. Figures 3.19 and 3.20 show somewhat smooth cyclical 



structure. The scatter plots with the other 
appear in Figures 3.6, 3.11, 3.16 and 3.21. 
modestly negative correlations. 



four variables 
All four indicate 
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IV. Supporting Details of the Comparison 



The data bank for any of the object variables may be 
viewed as a time ordered set of eleven 31 by 7 matrices. As 
we move thru time, the numbers in these matrices will roll and 
twist as in a wave motion that reflects the changing size of 
the total enlisted force and how those changes affect the various 
cells. The set of inventory matrices form the base of the stocks 
of people. Although individual people are not tracked in this 
set, it is helpful to draw attention to the fact that an indi- 
vidual changes LOS cell each year and PG cell periodically. Thus 
in these recent years of drawdown (in size) one expects relatively 
higher exits from the lower LOS cells. Such motion is reflected 
also in the other four variables since inventory provides the 
base of stocks on which each of the others draw. 

A little more detail concerning the three objective 
variables can be obtained by studying Tables 4.1 thru 4.3 which 
compare their changes over the most recent two years for the 
first 18 LOS cells and all 7 PG cells. The macro behavior of 
EAOS as shown in Figure 3.2 suggests that recently a smooth 
decline has taken place. Table 4.1 shows that these changes 
have been rather drastic in the first two columns and the lower 
LOS cells. Since the 'alpha' method smooths the time series 
cell by cell, it is expected to perform well in those cells 
that sustain a trend and poorly in those that do not. 
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LOS 



LOS 



TABLE 4.1 

RECENT BEHAVIOR OF EAOS SEPARATIONS 1975 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


2019 


7 00 


18 


5 


10 


0 


3 


7 59 0 


4940 


323 


3 0 


2 


1 


0 


779 6 


7126 


2205 


25 


3 


0 


3 


5627 


17820 


12906 


115 


7 


1 


0 


551 


1666 


2038 


135 


8 


1 


0 


466 


1270 


3712 


722 


10 


1 


2 


135 


209 


1216 


626 


12 


1 


2 


67 


160 


1053 


1049 


18 


2 


0 


49 


189 


1094 


150 5 


68 


2 


0 


37 


259 


1193 


1580 


125 


1 


1 


12 


98 


600 


1379 


249 


2 


0 


9 


70 


54 3 


15 5 5 


408 


17 


2 


3 


3 8 


447 


1921 


624 


36 


1 


8 


72 


465 


1833 


717 


74 


5 


4 


42 


314 


1694 


9 08 


167 


20 


5 


48 


288 


150 5 


1092 


235 


42 


0 


13 


132 


1089 


1147 


322 


69 


3 


10 


91 


672 


799 


242 


62 



SEPARATIONS 1976 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


624 


70 


1 8 


7 


6 


0 


0 


6017 


2942 


19 0 


23 


5 


0 


2 


14352 


12 09 2 


1140 


21 


8 


0 


2 


5906 


15 9 4 0 


15649 


54 


2 


0 


1 


469 


1471 


1 894 


103 


8 


1 


1 


221 


1280 


3 3 79 


823 


7 


1 


1 


78 


210 


1118 


620 


14 


1 


0 


3 7 


175 


1172 


120 5 


20 


4 


0 


19 


129 


75 0 


1168 


76 


6 


0 


35 


192 


1 096 


1671 


129 


2 


0 


11 


89 


6 31 


1093 


1 80 


1 


1 


7 


74 


488 


1132 


264 


11 


0 


2 


20 


252 


1079 


385 


17 


1 


3 


25 


310 


1219 


471 


39 


5 


3 


26 


3 37 


lino 


615 


55 


5 


3 


27 


233 


1010 


638 


94 


19 


2 


11 


115 


6 71 


613 


132 


3 0 


2 


4 


60 


484 


490 


124 - 


29 
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TABLE 4 . 2 



LOS 



LOS 



RECENT BEHAVIOR OF ELIGIBLES 



Eligibles 1975 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


894 


6R5 


1 6 


5 


10 


0 


3 


3386 


4814 


32 3 


30 


1 


1 


0 


2879 


69 41 


217 5 


25 


3 


0 


3 


2265 


17372 


12 7 8 6 


112 


7 


1 


0 


284 


1623 


2 003 


1 34 


8 


1 


0 


370 


1221 


36 71 


716 


1 0 


1 


2 


106 


202 


1203 


622 


11 


1 


2 


46 


157 


1039 


1 047 


18 


1 


0 


35 


178 


10 84 


1497 


68 


2 


0 


25 


248 


1172 


15 7 7 


125 


0 


1 


5 


96 


59 4 


1 3 7 1 


249 


2 


0 


4 


67 


538 


1549 


408 


17 


1 


1 


3 7 


438 


1915 


624 


36 


1 


7 


72 


461 


18 2 8 


712 


74 


5 


3 


40 


311 


16 88 


907 


167 


20 


5 


4 5 


288 


1502 


1 091 


235 


42 


0 


12 


131 


1085 


1147 


322 


68 


3 


9 


91 


672 


799 


242 


62 



Eligibles 1976 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


217 


51 


1 5 


7 


6 


0 


0 


2739 


287 6 


19 0 


23 


4 


0 


2 


5124 


11774 


112 3 


21 


7 


0 


2 


2 39 0 


15434 


15516 


5 3 


2 


0 


1 


264 


1425 


185 4 


103 


8 


1 


1 


131 


12 33 


33 3 8 


81 7 


7 


1 


1 


57 


199 


1107 


618 


14 


1 


0 


18 


167 


1157 


1 197 


20 


4 


0 


11 


126 


739 


116 3 


76 


6 


0 


21 


182 


1078 


16 60 


129 


2 


0 


7 


84 


620 


1090 


180 


1 


1 


4 


68 


4 81 


1121 


263 


11 


0 


1 


18 


252 


107 6 


383 


17 


0 


2 


20 


3 05 


1216 


469 


3 9 


5 


3 


25 


332 


1177 


611 


55 


5 


3 


2 7 


2 31 


1007 


638 


94 


19 


2 


10 


1 1 5 


669 


612 


1 32 


30 


1 


4 


60 


482 


489 


124 


29 



29 



The macro behavior of eligibles as shown in Figure 3.9 
also declines smoothly but Table 4.2 shows some sharp drops for 
LOS = 1 and some sharp increases for LOS =3. In the former case 
the advantage is to the alpha method and in the latter it is not. 
Regression methods must look elsewhere to pick up these signals. 

According to Figure 3.15 Non-reenlistments level off in 
the macro sense. Table 4.3 shows some rather drastic movements 
for the lower LOS and PG cells. Again LOS = 3 shows some 
sharp increases. 

The forecasting of the object variables using regression 
methods was performed repeatedly to meet several goals: First, 

the best set of p variables had to be identified. Second, 
the influence of the ridge constant (see Appendix A) needed 
some accounting. Third, the stability of the forecast where 
1976 data was not included in the forecast required examination. 

Table 4.4 contains a listing of the best set of p 
variables. The subscripts indicate time and the object variable 
appearing in the set with subscript t-1 indicates an auto- 
regressive contribution with a lag of one year. The correspond- 
MAE and RMSE values appear in Table 1.1. 

The resulting forecasting of separations does not depend 
very much on whether or not 1976 data are included in the 
forecast coefficients (as measured by the MAE and RMSE values) , 
but the inclusion does make a noticeable difference in fore- 
casting eligibles and non-reenlistments. This result was not 
anticipated from study of the macro plots. It may be, in part, 
a consequence of ill-conditioning (see Appendix A) . 
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TABLE 4.3 



LOS 



LOS 



RECENT BEHAVIOR OF NON-REENLISTMENTS 



Non-Reenlistments 1975 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


772 


550 


6 


2 


6 


0 


3 


29 76 


3125 


4 8 


7 


2 


0 


0 


2401 


4103 


764 


7 


3 


0 


1 


1 872 


13070 


6869 


IP 


4 


1 


0 


171 


877 


863 


51 


2 


1 


0 


91 


73 6 


2 3 52 


462 


5 


0 


1 


30 


74 


536 


261 


2 


0 


2 


12 


5 3 


277 


3 05 


~2 


“1 


0 


13 


3 7 


279 


302 


3 


0 


"l 


9 


51 


261 


256 


11 


“l 


0 


0 


12 


7 3 


156 


13 


”1 


0 


1 


6 


7 5 


138 


13 


1 


0 


0 


9 


20 


98 


12 


“6 


0 


1 


6 


24 


60 


"2 


4 


"2 


3 


2 


16 


22 


2 


“2 


1 


0 


1 


8 


23 


1 


1 


2 


0 


0 


6 


6 


”10 


5 


2 


“l 


0 


2 


1 


19 


"4 


0 



Non-Reenlistments 1976 



El-3 


E4 


E5 


E6 


E7 


E8 


E9 


133 


14 


6 


4 


5 


0 


0 


2359 


1857 


26 


10 


1 


“l 


2 


4137 


815 3 


3 79 


2 


6 


0 


1 


1862 


11099 


896 7 


18 


1 


0 


0 


1 52 


6 50 


764 


47 


4 


1 


1 


52 


667 


2 2 6 5 


553 


4 


0 


0 


25 


72 


5 08 


284 


3 


1 


0 


1 


41 


3 64 


3 86 


3 


2 


0 


4 


20 


172 


242 


0 


2 


0 


10 


31 


221 


287 


6 


0 


"l 


2 


19 


78 


140 


13 


1 


"l 


1 


2 


49 


94 


17 


2 


0 


0 


2 


23 


47 


12 


"1 


"1 


1 


7 


8 


26 


2 


"1 


1 


1 


3 


5 


29 


0 


3 


2 


1 


2 


6 


1 


"6 


0 


"3 


1 


0 


1 


3 


"9 


"l 


“l 


0 


1 


0 


1 


6 


2 


1 
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TABLE 4.4 



VARIABLE SELECTION FOR REGRESSION FORECASTS 



Object Variable 



P = 2 



P = 3 



p = 4 



Separation S^ 
Eligibles 

Non-Reenlistments V 




The use of a ridge constant in multiple regression 



serves to stablilize the regression coefficients when the re- 
gression variables are highly correlated (i.e. the problem is 
ill-conditioned) . The picture is cloudy, but generally its 
use is noticeable in forecasting separations and eligibles 
but not so much in forecasting non-reenlistments. A high 
level of ill-conditioning is anticipated whenever V, Y, and X 
are used in concert because of the logical relationship V = Y-X. 
Also a non-zero ridge constant may be appropriate when p = 2 
for eligibles and non-reenlistments. The value RC = .025 
in the space of standardized regression variables, seems to 
be a reasonable choice for the ridge constant, but overall, 
the ill— conditioning and stability merits further study. 



numerical comparison of 'alpha' forecasts with regression 
forecasts for p = 4 in the case of separations, and for 
p = 3 for the other two variables. In the latter cases the 



The remainder of this section is devoted to the 
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contribution of the fourth variable is suspect because of the 
relationship V = Y-X and the improvement indicated in Table 1.1 
may be unstable. 

The computation of multiple regressions individually 
for each of the 217 (31 x 7) cells is too time consuming and 
some grouping was necessary. The grouping chosen was based on 
the following partitioning of the eleven year averages for each 
of the object variables. Cells were treated individually as 
long as the eleven year averages were at least 1000 (1900 in 
the case of eligibles) . Then cells were grouped together in 
decrements of 100 down to an eleven year average of 100. From 
then on the groupings were in decrements of 10 with some arbitrary 
adjustments when zero was approached. 

Table 4.5 compares the (rounded) 'alpha' forecasts and 
regression forecasts (on T^ , separations. 

The ridge constant was .025 and all data were used in develop- 
ing the regression coefficients. The (rounded) errors of the 
two forecasts are compared in Table 4.6. 

Let us consider Table 4.6 in the light of the recent 
changes in separation shown in Table 4.1. For LOS = 1 
separations experienced a severe drop in 1976 in the first two 
columns. The "alpha" method piclced this up amazingly well. 

Of course this success is attributed to the use of 1976 data in 
the development of the coefficients. The same effect is 
exhibited in LOS = 2 for E4 . The first two entries for 
LOS = 3 in Table 4.1 show a sharp increase. This is a trend 
reversal that the alpha method does not respond to so rapidly 
even under these advantageous conditions. It makes its worst 
showing in these two cells. 
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ERRORS OF FORECAST: SEPARATIONS 
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Tables 4.7 and 4.8 compare the forecasts and errors of 
forecasts of eligibles for the 'alpha' and regression (on 
T^, methods. Again when we consider the major 

changes in 1976 by viewing Table 4.2 we see a sharp drop in 
coliimns 1 and 2 for LOS = 1 , a substantial drop in these columns 
for LOS = 2 and an increase for LOS = 3. Again the alpha 
method picks this up extremely well for LOS = 1, adequately well 
for LOS = 2, and responds poorly to the turnaround for LOS = 3. 
The regression method does surprisingly well for LOS = 1 and 
column 1. 

This same phenomenon continues when comparing the fore- 
casts of non-reenlistments. Table 4.3 shows a sharp drop in 
LOS = 1 that the alpha method picks up well, a drop in cell (2,2) 
is not so well forecast. The increases for LOS = 3 and cell 
(4,3) are again handled much better by the regression methods. 



36 



FORECASTS OF 1976 ELIGIBLES 



^ 0 (N^<rHCMCMCMCs[<rHT-i^ 



itLOLnco^aDzfCMLnTHcr5CMC7:cocr5zfo?coo 

THCOLnLOCOCT>CDLn<£)LOCOCSlCOrOTHTH^ 



^-rH^CNCSIt-ICMzJ-rOCOJ'^ 



CTvCoOzj-Lor^rocNLoaDUDcx. th 
CMCC^CO rtTHZfCOltl^tDtDOO 
^ tH CM CN CSi C\ tH 



CO ^ 00 cr> cr> 

It CO CM 



w 

4 J 

w 

rtJ 

u 

Q) 

u 

o 



CO 



cu 



c 

o 



CO LT CM 



o 



ooco^cr>cDt^Locotccx:cMcocr:ooo-rHC^ 

CM^ccLOOOLn^Oit<^c^THtccocr>itoco 

^CMCOCOCC(T;OOt^cDcCCN!^tH^ 



It 

It It 



m c^>! 

<N tH 



cc 



CD 




o 


CC 


o; 


o 


CD 


CD 


in 




00 


07 


CM 




in 


CO 




T-\ 


O 




T — 1 


O 


It 


cx. 


CD 


CD in 


It 


CM 


CM 






CM 


CO 


ct. 


CM 


cr- 


C^,‘ 


It 


CT> 


ID 


in 


It 


m 


r^ 


CO 


CD 


CM 


It 


tH 


It 


CM 


in 


CO 


tH 






















tH 


in 


CO 


CM 


O 


in 


CM 


CO 




in 


CD 


It 


o 


CD 


It 


CO 














































T-S 


tH 


tH 
































It 


cc 


CT« 


If 


CC 


o 


t — 5 


cr- 


It 


in 


CD 


tH 


It 


in 


It 


O'. 


CM 




CO 


CD 


CM 




in 


It 


CM 


CM T-. 


tH 


o 


O 


o 


in 


o 


o 


tH 


in 


cr 


CO 


CD 


o 


cc 


It 


CD 


CD 






CO 


It 


r-i 


CD 


CO 
























It 






CO 


cr 


It 


CM 


O'. 


o 


in 


It 


CO 


CO 


CO 


CM 




T-i 






























r—l 


It 




CM 


tH 

















































cn 

w 

<u 

M 

Q) 



in 


CD 


CM 


CM 


It 




CD 


CM 


in 


o 


CJ7 


00 




in 


cr 


O' 


00 


O 


O'. 


CM 


in 


cr. 






CM 




CM 


CM 


tH 


CM 




in 


T— 1 


in 




















tH 














CO 


00 


CM 


00 


cr. 


CM 


cr 


O 


in 


r- 


It 


o 


r^ 


07 




cr 


cr 


c-- 


CO 


CM 


CM 


O'. 


CM 


CO 


CO 


CM 












CO 


in 


tH 















<j^CMLC;c--(T.Lr)Or^ 
cc CO CO CO CO CM 



ODUOCOCNCMTH'rH^OOOOO 



ooooooooo 



o 



CM 



CO 



tH ^ tH tH tH ^ 



r>cooo^cDuoco^ 

th CO m It cc cT> 



o CM ^ 
o ^ ^ It 



cr o cr CM LO 
ID CM tH tH 



t-H^tH^CMCMCMCOCOCOCOC 



co<rHcocr>croccooocor^cMcocriCMcoLocx:oo 

C 00 DC 0 OcCCCit<i 3 OOOlt^CML 0 -rH 

^CJCMtHCMCMtHtHtH 



CO It LO cc o CM 



O-rHCMCT>L0O^Cr. OCOlOCT. CO 
CM 03 ^:tOcX 30 cCr^OLOLOTH 
^CMituotor-(T>occr-c^ 



CO zt* CO CO CO It 
It in «co cc CM It It 

CM tH 



th ^ in LO cc 



CM 



-p 

CO 

fO 

u 

0 ) 

u 

o 

u-i 

ra 

x: 

Qa 

rH 

c 



r--^t^inccor^itOitititincDr^itOcMCocM^critittncc 
CMCMCtDcr. COCCCMCOOcT>ltCM’rHTHCM 
coina'/C^initoccitco 

T— I tH ^ ^ 



CO CD C 75 C^ 
TH-rHCtCOt^CMCM^t^ 
tH CD CD CJ> CO It 



Cl 

a, 



CO 


O' 


It 


rH 


It 


CM 


It 


CO 


It 


O 


in 


rH 


r- 


rH 


CD 


CO 


ir-i 


00 


cr 


in 


r^ 


in 


CD 


CO 


CO 


CO 


o 


O' 


CO 


m 


C 


oc 




CM 


cc 


rH 


cc 


in 


rH 


O 


o 


CM 


CD 


It 


It 


It 


CO 


CM 








CO 


rH 


CO 


rH 


rH 


rH 


rH 




















rH 


























00 


It 






CD 


r- 


CM 


It 


CM 


It 


CO 


CM 


CM 


CM 


oc 


r^ 


C 


It 


c 


00 


cr 


ID 


O 


00 


00 


in 


cr 


CD 


CO 


CD 


CO 


It 


rH 


in 


O'. 


CD 


in 


CO 


CM 


rH 


r“! 


CM 
















It 


r- 


CD 


rH 


rH 




























r-i’ 


























It 


CO 




in 


in 


CO 


CD 


00 


CO 


in 


r- 


CD 


CO 


CD 


It 


It 


CO 


CM 


o 


in 


CO 


It 


O 


CO 


CM 


CM 














CM 


It 


in 


CM 


CM 


CO 


rH 





















CD CD Zf 



CO o CM in 

T— I tH 



CO th CO c>; 



CO 00 tH CC CM tH tH tH r—{ i r—'. ^ t— I tH 



It tH ^ tH ^ ^ 



CO CO CM 



CO 

o 



37 



ERRORS OF FORECASTING 1976 ELIGIBLES 



^CNOOOrHCNICNCNlTHO 

I 



THTHCSCNlJd-CDrHCMCOxH 



LO LO r-. 00 tc CM 

tH tH I 



+J 

w 

fO 

o 

0) 

U 

O 



^T^^c^lv^OTHOcoTHco(r)c\•o^lOO^o^o^I:ln^-^ozl-^-.r^cT;oo^-J3-u^ 

I T^z^cocnT^c^Ocnc^^c\’r^r:r^^o,H^ 



THTHCNiocx:oa?c'r^c\c7)cotciDcN<X)tDrocr!tcomcnmtDoc^LOc\cr-^ 

•• ^ ^ c\ CN’CNIZfCMZfCDCLOCOCN^rHr^Zfm C\OJ 

I -iH c\’ -d- CN tH ^ I 



rn 



Cu 



c 

o 

•H 

w 

to 

Q) 

U 

tr- 

0) 






^d-cDircni-^cocrocrcococvjC^cotcoccr. mr^Czftcocr:rnOc\-^^cM 

I •^^csiozftDcDvccNt^tcinmLCinTHzJ'cc^THTH i'l 

CNCNI I I 

I 

a;OC^(N^CCCJ-(N•l^>t^J■Oc^■OCCOCr-|^J-<^lt^CMOOr^r^r-^r-^Cr^C 

CO oc o <r. -d- CN th tc cm rH ^ c\* lc r: I i 

CMC\’Z^Z^CC^0^TH||^| 

tH I I 
I 

^c\'c\'CO,Htct^ir5cncctnzft^r-^oocrr:zfmoo,HO^cocooc 

^THO^rcroccr^iDcc ^ 

in tH ^ CO I 
^ I I 

tCcnocc\’inTHCs:cNiztcDcsirorozfCNi^rHOCMCNOOoooccocoo 

CN^TjtO^THtDja'inCNI I 

tH tH ^ tH tH 

" ' CO 

o 



^^OoOcNlTHTHTHrHOTHxHCNTHCD^r^rOTH^THZflDCOCOinrO^mrO 
I THTHCNlTH^rO|CM<r|CNirC 



THrHTHtHTHxHTHCOtHCN;rHtDCMOCLOt*-lt<I3rCOCOinCC^mOC^ZfrOjt 
I I I THCMr^THcot£:oc^jTHininTH| cs'cn 



-p 

CO 

fO 

u 

0) 

u 

o 

Pm 

rO 

s: 

a. 

f— I 

< 



o 

a 



corororoorocNomr^cncsir^CMCccsirH<i)Lnrot^cnr^t^ococin^LOOro 

I I I ^rHCCjj-c^roLnrocDtDt^o^zfrcztTHl cn^ 

I I tHtHtHtHCOCOtHCNtHI 



cozi-ro<l50Crooc\'ja'Omoc^t^t^LncoTHCOrozf^m 
I I I zl'COzj' t^mocrjOcN.if^c^^zi-zJ'TH 
^ CNT-^rHC\'^OCinztCOzfTHTH 

I I I 

CN<TTHi^Oitt^^i^c\'inOLni£:i£:cs:<rtDr^zfcninr^TH 
I cr. IT c\ T-s in cs: O', m ^ tH oc ^ CN irj tH CN r: I 
ro c\' tH Csi tH I tH ^ I 
CNI I 

I 

r^ocr^c^z^cor-‘^oc^io^^^Ja■CNcr:0^ccc^D^^oooc 
intDlTint^CO rHinC^ l r^Zf^HCNi 
iD CC C\' tH t— ! 
tH cn rH 
I 

r-Zft^mTHCNCJ)OcNi:a'OcNCNZfrHTHTHCOTHTHTHTHTHTH 
^CO^rOCNI^ZfCNIrH I 



in tc jt CNI CO CC in 



CC CN ^ X-! CM CN 



tH O ^ 



CD CD 
T-\ I 
I 



c\ 



CO 

O 

PI 



38 



FORECASTS OF NON-REENLISTMENTS 1976 






^tH^CM^CMCM^OCSICSItHOtHOCO 



CO^OJCN^COlOCOCNOJin^lOOJCO^COCOJ^OOCvjOcsiCSlTH^^THOCM 



o 

x: 

■p 

0 ) 






CD^CDCCC^OCtcmr^tCOCj-CNCOcrCNCTlCC 



C\C0CT)C0(j5C7>LO2fLnC0C\l^ 



cn 



II 

04 



crcoc^^tc^oc^^c ^r^cMt/iccrfr^cDc^ 
▼~i^o>:cc(Dcor^oc^CMOtor^ir)cs:THTH 

O OCCCriCSICNTHTH 

CM 



C^rCCCCCCVJCNICNlOCCC 



c: 

o 

•H 

w 

w 

a 

p 

CP 

<D 

(t; 



cr!THcrit^r-*r^a:ocr^THiDir)r^r^oc^t£)^L'^, 

rrCDtCtrcC^2ttC0CTHC0C<NCMC\,H^ 

cccctrr--;fcNc\T-i 

OC <N 



C\C\’^CNC0C0C00 



^ ^<^ococcr5 02tLnc\oo^-<T:iniricv4CN^T-fTHCoooooooco 
^incr02fir:cocr:^c\‘^ 

CN c\’ LO ^ in ,-1 
^ ^ OC ^ 



cocPT-^LCjTHOocrorocnromcsiCNCNiTH'rHOOOooooocooooo 

OOCOCOtDtDcnOOCNI'fHTH 
c-* ^ OC CN 
CO Jt tH 

CO 

o 

h:| 






<»H T— 1 tH t— I CNI tH tH CNI tH tH t— I CN tH CO ^ CN tH ^ tH CN tH JUJ" tH tH ^ CN tH I ^ t— I tH 



CNiHCOCOmtDOOCNCNOCittCCPr^JtltCOCNCOTHCOtD^^CNCO^CNtHCNTH 

tH cn t— I 



T3 

O 

XI 

4J COC2fCPrOLOCNCOCDjtOCtC'«HCCTHCNZfrfLnCDLOCOCN'«HCOLrj'^CNTHiHT-! 

QJ T~i THLOCDOCCPC^^tDCOCP^^'«-l 

S O zfOiOJCvJCN^TH 

Hi 

d 

XI 

CM 

rH tCC'-r^L 0 C 0 r^O 2 f 0 COCNCT)Cr. Cr 5 ^CPtDC^J-d“C^^^LO^THTH^T-S^THTH 

<tj COLnCN^CJiCOCNCCr^COtDCOTHTH 

iCr^OCCOLOCOCN CN 
r- CN 



CCcriCNUO'T-lCOrfOLnOCCPLOLOJtOCLO^CNCNTHCNi-lT-lTHt-l^rHTH^I^TH 
^rLTLnCOTHTHC^LCiCOJtTH tH 

c CO OC CD 

CO rf CN 



THOCs|OCOC^COr**“O^^THCN^CNCNTHT— I t— It— It— I^T— 1^ 

tOOOOlOOcO tH 

T— I O 00 OC T— I T— I 
CO CN tH 



CO 

o 

XI 



39 



ERRORS OF FORECASTS OF NON-REENLISTMENTS 1976 



O CNJ -iH C 

I I 



OCNCsl'cH'iHCNCN'rHCNOcOzfCNCO^CNTH'rHTH^CO^O^^CO 

I 



O <tH ^ CN 



^^CNC0^CNC0C0CNCDmrCCNinitt£)mTHCC0^<iH'rH'fH-rHCCs: 



4J 

to 

05 

u 

d> 

u 

o 

Ua 



-iH CO CO 00 ^ CO 



cx5r^oincoOTHCJ>cccoo^coc^rouoooCT)OO^uoin<iH'rH^ 

tHtHtH I iHrHCslT-lTHTH 



ro 



II 

o 

Qa Ph 



c 

o 



•H 

to 

to 

(U 

u 

tp 



in 00 CO 



O) CM 
-rH in 
tH 



ino'^c^incoincrc^inoc^ 

'iHCOrt'rHT-i'iHltCS’CNCslTH 

I I I 



^THinCCOCOCO'fHCN'rHC^C 

r-i I I 



COmCCOCCNJCP-:tlC^COC^CNC'CNIC\*Cir)rOCN'fHTH CNCOCCCCO 
itcCr-lOCOCCCOCrHl CNICO THCNtHTH I I 1 *^ 

CO to CO CN' "IH rH 

I I I 



Cltr-CPCNCOOOCOin<iH'iHCCinCN:CNCOC\'rH^'«-;C,HTHiHO occoc 

OcnCOCCOCN^TH-rH III 1 ^ 

CO ^ cf in ^ 

-iH CN II 



OO^r^CnCOCOzfcncn-rH^CNlTHTHOOOOOOCCOOOOOOOO 

inC0C0C7)OC0-fHCN 

lO tH to CO ^ 

T— I I 



o 

l-q 



O^OOOTHOC^CNCNTHCNOCOZfCNCN<rHCMO<rHCNICOCNTHTH^O<iHrH 

I 



CCNT-iTHCxHO^^^CNOCN^^CNCNCOCNltCOCNvHCNCOTH^^THTHTH 

I I 



4J 

to 

fd 

u 

0) 

J-l 

o o 

Ua Ph 

rd 

Qa 

iH 

< 



COCcO’r^OTH^Z^CNCN^CX:tOlncJ-OxHr^^lncNC^lTH^CCNC\<r^-r^OO 

III T— I X— I I I 



CNxHC^OinoOCOZj-ItJ-OCO>.:tCNxHCNr-J-xHinc\CxHxHCOCOxHOxHCxH 
II 001 <PCOCOCNltrtCN!xHxH 



xHCs:r-Cs: 0 '. CNxHxHOcTCOO'. in^CCCNinCs:cOCx-iCOC\xHxHxHxHxHxHCxH 

I x-iC^ltt^COCNzfO^^ 

CN CM x-{ I 



I 



CNC\xHinxHincNaDintC,HCOCNC^OcNTHxHCNxHxHOO^OOCOOOC 
COCOCOlCzJ- t-ixHl I 

Cs: J- CM xH xH 
x-i CO XH 

I 



c^otncoinoo^ooinc^^^oooc 

CM It CO lO It xH I 

to CO I 



t/D 

o 



xHxHxHxHOthOOOOOOOO 



40 



V Conclusions and Recommendations 



A fair comparison of the two methods could not be made 
on the one hand because 1977 actuals are not yet available and, 
on the other hand, because the alpha matrices for 1975 were not 
available. The alpha method appears to be a time series type 
method that possesses the typical lag characteristics at peaks 
and troughs. The regression methods appear to be competitive 
and have the feature of being more stable. 

Some examination of the residuals (over time) of the 
regression methods took place. Although some firm lag correla- 
tions are present, they do not appear to be significant, based 
on the computation of Durbin-Watson statistics. Thus a hybrid 
system using both time series and regression methods is not 
expected to produce highly substantial improvements. 

It is recommended that a fair comparison be made and 
that an appropriate measure of the cost of forecast error be 
developed for it. The real extent of time series "overswing” 
would become apparent and its effect would be examined in 
more realistic terms. 

It is desirable to develop the regression approach 
further. The question of the correct set of ridge constants 
needs be faced more carefully as well as the question of the 
grouping of cells. The residuals should be examined more 
carefully with one eye focused on the time dependence behavior 
and a second eye looking for suitable transformations to 
remove skewness and stabilize the variance. 
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APPENDIX A 



FORECASTING MANPOWER CHANGES USING REGRESSION METHODS 



Data and Notation ; 

Ten years data are available, specifically 1966-1976 (fiscal 
year begins on 1 July of previous year) . The first subscript, t, 
will index the years (t = 1,...,10) with t = 1 referring to 
fiscal 1966 (1 July 1965 to 30 June 1966), etc. The second and 
third subscripts refer to length of service (LOS) and pay grade (PG) , 
categories respectively, using the indices i and j. Basically 
then the 31 LOS categories are interpreted as follows : An enlisted 

man with LOS = i is one who on 1 July of that period had completed 
at least i - 1 years of service but less than i, for i = 1,...,30. 
If i = 31 then the number of years completed service is "at least 
30." There are nine PG categories referring to the pay grades 
El , . . . , E9 . 

The resulting data arrays were too cumbersome for our explo- 
ratory work and some arbitrary grouping was imposed. Specifically, 

LOS categories 6 through 17 were grouped together as were categories 
18 through 31. Also PG categories El, E2, E3 were aggregated. 

Thus the present study treats seven LOS groups and seven PG groups. 
Notice that the detail lost in aggregating the high LOS cells is 
partially recovered because the corresponding (high) PG cells are 
intact. Similarly, the intactness of the low LOS cells retains 
some of the information lost by aggregating the low PG cells. 

The following notation was adopted for the given data . 

G gains 
A attritions 
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R retirements 

X retentions (re-enlistments) 

Y eligibles 

S separations (EAOS) (S > Y > X) 

T inventory (total Navy enlisted) 

Some additional derived quantities are useful. 

U = S - Y ineligibles 

V = Y - X non-reenlistments 

W=U+V=S-X contract losses 

Some explanation of these quantities is helpful. Gains 
refer to the number of people from outside the Navy that enter the 

enlisted Navy during the fiscal year in question, in each LOS, PG 
category used. New recruits are not included since they need not 
be forecast. Promotions represent internal movement and are also 
not reflected in the gains used here. Changes which are included 
in gains are those persons who reenter the service after having left, 
under programs called continuous service or broken service reenlist- 
ment contracts. A category called miscellaneous gains is also 
included here, representing gains by various methods, not including 
the recruits. 

Attritions refer mainly to people who are dismissed prior 
to the expiration of their contract. It also includes deaths, 
disability discharges, etc. Retirements begin in LOS category 
18. The only means of leaving the Navy aside from attrition and 
retirement is by failure to reenlist at the expiration of the 
contract, i.e., contract loss. All personnel are separated at the 
end of their contract. Not all separated personnel are declared 
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eligible for reenlistment and not all eligibles exercise their 
option to reenlist. Hence the inequality S > Y > X. The derived 
differences (S - Y and Y - X) are called ineligibles and non 
reenlistments, respectively. The sum of these two are the contract 
losses. 

To all the variables may be affixed the subscripts t, i, j 
which refer to periods and categories already described. All of 
the variables (except T) are interval functions and refer to the 
net result of a time period (fiscal year) . The variable T 
referring to the total size of the Navy is a point or "snapshot" 
variable and refers to the number of personnel on board on the 
first day of the designated period (left end point or, more specifi- 
cally, 1 July of the fiscal year) . 

Scope of Current Study 

Regression methods, specifically ridge regression, are applied 
to the forecasting of contract losses, gains, and attrition by total 
numbers in each LOS group and in each PG group. All data sets of 
variables are three dimensional arrays. Since the present work is 
concerned with exploring the usefulness of a methodology, the dimen- 
sions of the data sets were reduced to two in order to obtain 
simplicity and uniformity. This was done in two different ways 
since LOS groups and PG groups have separate interest. Thus when 
studying the predictability of the LOS groups, all data arrays were 
summed over j = !,...,? yielding time by LOS group matrices. 
Similarly when studying the predictability of the PG groups, all 
data arrays were summed over i = 1,...,7 yielding time by PG group 
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matrices. No combining or mixing of the two kinds of groups took 
place in this study. Thus we attempted to predict the various 
changes by LOS and by PG marginally, but not jointly. 

The choice of regression variables was made on hueristic 
grounds. The volume of data is quite limited (ten time values) 
and this limits one to the use of only four or five regression 
variables so that at least a few degrees of freedom remain for 
estimating the mean square error. Also any autoregressive feature 
was limited to the single most recent time period (or time point, 
in the case of the variable T) . Thus the ten time periods still 
yield nine full sets of observations. The degrees of freedom for 
estimating error are given by n - p - 1 where n = 9 and p = the 
number of regression variables. The choice of variables (same for 
each of the two kinds of groups, LOS and PG) appear below. 

Contract Losses regressed on Ineligibles, Reenlistments, 
Separations, Total Inventory. 

Gains regressed on Gains, Attritions, Ineligibles, Separations, 
Total Inventory. 

Attritions regressed on Attritions, Ineligibles, Reenlistments, 
Total Inventory. 

Further exploration could yield a better set of regression 
variables. The results so far are encouraging as will be seen. 

Methodology 

The development of ridge regression in the last five years 
(see Ref. 3, 4, 7) is proving to be an important step in treating 
the anomalies of regression problems. It's use is especially 
attractive where the correlation matrix of the regression variable 
is highly non orthogonal — a condition that is met liberally in the 
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present problem. The key to its successful application is in the 
selection of the ridge constant k. Current thinking on this 
question recognizes several competing forces and suggests the selec- 
tion of a range of values for k in which all the forces are rather 
stable. 

More specifically, these requirements may be summarized as 
follows ; 

(i) The variance inflation factor of the estimates of the 
regression coefficients should be at least one but certainly not 
as large as ten, (Ref. 6, p. 609ff.; Ref. 7). 

(ii) The ridge trace should be stable. This includes the 
accomplishment of all reversals in sign with respect to the 
initial signs at k = 0 (ordinary least squares regression) , (Ref. 4) 

(iii) The mean square error (MSB) of forecast should not have 
increased greatly beyond the initial values at k = 0, (Ref. 4, 7). 

These three requirements are the author's set of guidelines 
formed from the materials in the references. The first deals with 
the variance inflation factors which are defined by Marquardt, as 
as the diagonal elements of (see Ref. 6 p. 609) 

[X'X+kl]"^ (X'X) [X'X+kl]"^ 

when X'X is in correlation form (i.e., correlation matrix of the 
regressive variables). For our set of problems, criterion (i) is 
met uniformly for k > .02. 

The ridge trace is the vector of regressive coefficients 
viewed as functions of k, the ridge constant (Ref. 3). Typically, 
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they are unstable and change sharply as k moves away from zero, but 

settle down after that. It is recommended that k be large enough 

so that any regression coefficient that is going to change its sign 

from that at k = 0, be allowed to do it. Much of the information 

2 

in the ridge trace is summarized by L (k) , the squared length of 
the regression vector of coefficients. The ridge trace is also 
valuable if one is selecting variables for deletion. 

For our set of problems, many of the ridge traces settle down 
quite quickly by the time k reaches .04. There are some strag- 
glers however, but even so all have stabalized by the time k has 
reached .2. 

Accordingly, the range .05 ^ k .2 was chosen for further 

study. Typically the MSB grows modestly in this range. The movement 

2 2 

of the MSB is represented by the movement of R , where 1 - R 
is the ratio of the sum of squared errors to the sum of squares of 
the dependent variable. This corresponds to looking at the degrada- 
tion in the square of the multiple correlation coefficient, the two 
coinciding when k = 0. 

Table 1 contains initial information for the application of 
ridge regression using the chosen variables. On the left are the 
three kinds of variables W (contract losses) , G (gains) , and A 
(attrition). For each then are 7 LOS groups and 7 PG groups. Ridge 
regressions were performed for k starting at zero, advancing in 
increments of .005 until 0.1 is achieved and then 0.2, 0.4, 0.6, 0.8, 
and 1.0. 
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The ranges of values of k that brought the VIF (variance 
inflation factor) into the range of one to ten are tabulated next. 
Values of k this small are expected when all variables have been 
standardized. (Ref. 6) 

2 

The column headed R (0) is the square of the multiple 
correlation coefficient under ordinary least squares (k = 0) . 

It is one minus the ratio of the sum of squared residuals to the sum 
of squares of the dependent variable. It's use as a measure of the 
percent of variance accounted for is tenuous in the current appli- 
cation because these are so few degrees of freedom to estimate the 
variance of residuals. (Ref. 1, 5). Thus, large values are en- 
couraging but not to be depended upon. They measure the level of 
"explanability" for this particular set of data, but the measure 
is not reliable for prediction. It can be used to measure the 
change in the sum of squares of residuals as k increases. 

The remaining data in the table give indications of the degree 
of "ill conditioning" of the problem. The "min eigenvalue" refers 
to the correlation matrix X'X of the regressor variables. For 

orthogonal data all eigenvalues are unity. The small values indicate 

2 

substantial ill conditioning. (Ref. 3, 7). The quantity L (k) is 

the squared length of the regression coefficient vector. It will 

2 

be greatest when k = 0. The L range values correspond to values 
of k in the preceding k range. These values have stabilized in 
all cases. The asterisks (*) denote those groups whose regression 
coefficients have not stabilized in sign within the k range. 

Some basic data for our 42 cases are contained in Table 2. 
Following the designator columns are the means, standard deviations. 
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and coefficients of variation (ratio of standard deviation to mean) 
of the dependent variables. 

For illustrative purposes (and for fun) it was decided to 
display a complete set of regression predictions for 1976. The value 

k = .05 was chosen arbitrarily and applied uniformly. The values 

2 2 
R (.05) should be compared with the R (0) values of the earlier 

table to indicate the growth of the sum of squared residuals as k 

increases. The last two columns contain the 1976 forecasts and their 

root mean square errors (sum of squared residuals over n - p - 1 

raised to one-half power) . Because of the correction for degrees 

of freedom the RMSE is actually measurably larger than the dependent 

variable standard deviation in five of the cases. (W Los 1, W Los 5, 

G Los 2, A Los 2, A Los 6-17.) In thirteen of the cases it is 

dramatically smaller--this is especially notable because all of the 

five figure standard deviations are converted to four figure RMSE 

values, and several four figure standard deviations are reduced 

to three figures. The remaining cases are in the range of no change 

to modest improvement. 

Finally the corresponding regression coefficients (converted 
back to the original dimensions) appear in Table 3. These values need 
interpreting, i.e., why should contract losses be negatively correlated 
with separations in some cases and positively in others, etc. These 
questions may have rational answers, or they may indicate the need 
for a better selection of variables. 

Discussion 

The current exploratory work should be continued — seeking 
better sets of regression variables (not necessarily uniform in 



49 



kind across the cases) , individualized values of the ridge constant 



(k) , and perhaps some refined modeling. Also 
important exogeneous variables (e.g., dates of 
planning targets for the size of the Navy, the 
but they may be hard to identify. 



there may exist some 
major policy changes, 
unemployment rate) , 
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Dep 

Var 



Group 


k range 
(based on VIF) 


o 

CM 


Min. 

Eigenvalue 


( 0 ) 


L^ range 




LOS 1 


. 015 , 


.05 


.41 


.006 


24.6 


. 55 , 


2.47 




2 


. 02 , 


.03 


.94 


.002 


55.7 


. 74 , 


.93 


* 


3 


. 015 , 


.03 


.98 


.005 


4.6 


. 93 , 


1.11 




4 


. 015 , 


.03 


.99 


.012 


1.0 


. 90 , 


.93 




5 


. 015 , 


.03 


.36 


.005 


32.3 


. 94 , 


2.3 


* 


6-17 


0 . 


.1 


.91 


.104 


1.1 


. 72 , 


1.12 




18-31 


.005 


.1 


.55 


.058 


.9 


. 53 , 


.89 


* 


PGl -3 


. 01 , 


.03 


.84 


.002 


26.8 


. 57 , 


1.10 


* 


4 


. 015 , 


.1 


.96 


.021 


1.1 


. 73 , 


1.03 




5 


. 015 , 


.05 


.74 


.010 


.8 


. 70 , 


.77 




6 


0 . 


.05 


.65 


.087 


1.6 


. 85 , 


1.63 


* 


7 


0 . 


.05 


.81 


.060 


.5 


. 42 , 


.49 




8 


0 . 


.05 


.45 


.101 


.8 


. 65 , 


.78 




9 


0 . 


.1 


.95 


.259 


1.6 


1 . 10 , 


1.64 




LOS 1 


. 01 , 


.1 


.99 


.035 


4.3 


. 76 , 


2.86 


* 


2 


. 015 , 


.1 


.85 


.028 


4.7 


1 . 01 , 


3.47 




3 


0 . 


.05 


.62 


.096 


1.2 


. 57 , 


.75 




4 


0 . 


.05 


.79 


.138 


6.3 


. 68 , 


1.57 




5 


0 . 


.1 


.73 


.062 


4.6 


1 . 02 , 


4.62 




6-17 


0 . 


.1 


.68 


.119 


1.2 


. 89 , 


1.19 




18-31 


0 . 


.1 


.97 


.059 


4.0 


1 . 11 , 


4.03 




PGl -3 


. 01 , 


.1 


.99 


.030 


4.2 


. 78 , 


3.47 


* 


4 


. 005 , 


.1 


.41 


.041 


.5 


. 34 , 


.46 




5 


0 . 


.1 


.62 


.062 


1.4 


. 90 , 


1.41 




6 


0 . 


.1 


.73 


.041 


.8 


. 56 , 


.78 




7 


0 . 


.1 


.69 


.120 


1.2 


. 90 , 


1.16 




8 


0 . 


.05 


.67 


.210 


1.3 


. 96 , 


1.26 




9 


0 . 


.1 


.71 


.265 


1.8 


. 49 , 


1.83 


* 


LOS 1 


0 . 


.1 


.96 


.311 


1.4 


. 94 , 


1.37 




2 


0 . 


.05 


.24 


.126 


.3 


. 25 , 


.33 


* 


3 


0 . 


.02 


.69 


.253 


1.0 


. 86 , 


.95 




4 


0 . 


.1 


.78 


.295 


.5 


. 44 , 


.51 




5 


0 . 


.1 


.59 


.109 


1.2 


. 66 , 


1.18 




6-17 


0 . 


.1 


.42 


.164 


.6 


. 35 , 


.57 




18-31 


0 . 


.05 


.33 


.094 


2.7 


1 . 22 , 


2.72 




PGl -3 


.01 , 


.1 


.93 


.039 


.8 


. 60 , 


.78 




4 


0 . 


.1 


.94 


.110 


.8 


. 53 , 


.79 




5 


0 . 


.1 


.85 


.185 


1.1 


. 70 , 


1.13 


* 


6 


0 . 


.02 


.59 


.119 


.9 


. 75 , 


.90 




7 


0 . 


.1 


.56 


.251 


1.1 


. 62 , 


1.06 




8 


0 . 


.05 


.65 


.425 


.8 


. 68 , 


.78 




9 


0 . 


.1 


.55 


.277 


.9 


. 62 , 


.92 
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RMSE 



Dep 

Var 

W 



G 



A 



Group 


Mean 


Std 

Dev 


LOS 1 


889.9 


954.6 


2 


10545.6 


6410.5 


3 


21957.2 


11497.9 


4 


47718.0 


16041.8 


5 


8832.9 


2833.4 


6-17 


9853.2 


2614.7 


18-31 


875.3 


283.3 


PGl-3 


27683.2 


8296.4 


4 


44525.9 


12773.1 


5 


24146.5 


10168.7 


6 


3298.4 


747.3 


7 


837.1 


316.2 


8 


121.7 


32.4 


9 


59.2 


18.7 


LOS 1 


3482.3 


2476.2 


2 


1978.9 


1185.5 


3 


1401.3 


508.8 


4 


1508.0 


483.4 


5 


1108.8 


207.9 


6-17 


5518.1 


1242.5 


18-31 


935.6 


323.5 


PGl-3 


8531.9 


4707.1 


4 


2675.9 


650.4 


5 


2508.0 


379.4 


6 


1233.8 


302.4 


7 


802.0 


321.8 


8 


117.4 


38.4 


9 


64.0 


21.7 


LOS 1 


20545.1 


7329.1 


2 


10131.1 


3000.8 


3 


5972.9 


1499.3 


4 


3001.3 


961.3 


5 


1095.2 


251.7 


6-17 


5767.1 


1617.6 


18-31 


876.1 


105.5 


PGl-3 


35783.0 


8841.4 


4 


5345.8 


1839.6 


5 


3277.1 


1171.3 


6 


1851.4 


664.6 


7 


878.6 


377.2 


8 


189.0 


71.8 


9 


64.0 


13.6 



Coef 

Var 


R^(.05) 


Forecast 

1976(k=.0! 


1.07 


.291 


569.8 


.61 


.847 


6051.1 


.52 


.959 


3131.8 


.34 


.988 


21117.8 


.32 


.224 


6589.1 


.26 


.901 


3712.5 


.32 


.547 


756.0 


.30 


.791 


18604.2 


.29 


.954 


28005.3 


.42 


.742 


10955.0 


.23 


.634 


2463.3 


.38 


.810 


249.5 


.27 


.451 


77.7 


.32 


.948 


13.1 


.75 


.945 


7800.8 


. 60 


.408 


5003.2 


.36 


.614 


2704.7 


.32 


.734 


1521.9 


.19 


.685 


1049.2 


.23 


.666 


3412.0 


.36 


. 693 


929.2 


.55 


.951 


19424.3 


.24 


.791 


2803.4 


.15 


.445 


2229.4 


.25 


.669 


825.1 


.40 


.801 


756.7 


.33 


.844 


80.0 


.34 


.623 


52.1 


.36 


.957 


31200.8 


.30 


.243 


9611.2 


.25 


.686 


6606.8 


.32 


.779 


1785.7 


.24 


.581 


721.4 


.28 


.419 


7608.1 


.12 


.297 


764.0 


.25 


.924 


50379.2 


.34 


.932 


3924.1 


.36 


.846 


2583.2 


.36 


.587 


1074.7 


.43 


.557 


881.9 


.38 


.653 


227.7 


.21 


.542 


72.6 



TABLE 2 



1137.1 

3543.6 

3298.6 

2474.9 

3529.7 

1163.4 

269.7 

5362.6 

3895.3 

7301.7 

639.3 

194.7 

33.9 

6.0 

1010.6 

1490.0 

516.1 

406.9 
190.5 

1173.4 

291.2 

1708.9 

485.4 

461.2 

284.1 

234.5 

24.8 

21.8 

2138.9 

3691.4 

1187.5 

638.5 

230.5 

1743.5 

125.0 

3447.0 

678.0 

649.2 

604.1 
355.0 

59.9 

13.1 
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Regression Coefficients of the 
Indicated Variables (k=.05) 



Dep 

Var Group 



W LOS 1 
2 

3 

4 

5 

6-17 

18-31 

W PGl-3 

4 

5 

6 

7 

8 
9 



G LOS 1 
2 

3 

4 

5 

6-17 

18-31 

G PGl-3 

4 

5 

6 

7 

8 
9 



A LOS 1 
2 

3 

4 

5 

6-17 

18-31 

A PGl-3 

4 

5 

6 

7 

8 
9 



Const. 

“854. 697 
“3323. 404 
“3583. 668 
"15586. 46 
“169.258 
“4943. 373 
1437 . 579 

11797. 36 
“2303.351 
’45096.391 
4465 . 6 
“934. 238 
3.105 
59.102 

Const. 

“4275 . 732 
693.392 
1891.658 
1765.656 
428.317 
10478. 53 
“341 . 382 

6663.93 
6354.993 
_2478. 81 
1655. 742 
867.984 
“227. 022 
“68.438 

Const . 

32096. 51 
“1762.357 
“4252.39 
“5700.411 
2597.368 
1115.538 
26 . 042 

"2625 . 808 
6853. 783 
5675 . 728 
867.841 
423.653 
7794 . 1 12 
429 . 847 



U 

0. 33028 8 
“O. 25541 
“2 .480848 
0 . 748573 
“2 .282954 
11.169272 
0. 007831 

“0.193937 
14.6953 
12 . 058667 
“5. 851896 
“0.899586 
1 . 013038 
0. 842668 

G 

0 . 039999 
0 . 895267 
0.779972 
0. 322559 
0. 638435 
“0.667479 
“0.055081 

0.453469 
"0.404924 
"0.231998 
"0.207115 
"0.209065 
"O . 167405 
’0.643818 

A 

0. 754409 
0.439853 
0. 460742 
0.1195 
’0.246084 
’O. 38336 
0.245201 

0.916001 
0.298235 
0. 207994 
0. 300298 
0. 674542 
0.151657 
0. 196835 



V 

0. 693534 
0 . 952801 
“O. 12472 
0. 148314 
0.467961 
0. 130415 
“O. 058284 

0.568101 
“0.086057 
0. 253253 
0.4956 
0. 371512 
0.411105 
1.052868 

A 

0.393833 
0.055717 
"0.150336 
"O . 304602 
"0.419748 
"0.489492 
1.402199 

0.289034 
'0.285919 
'0.156558 
'O . 228523 
0.418086 
0.176314 
0. 901776 

U 

’0.401486 
2 . 2 0232 1 
3.664622 
3.283925 
0 . 234793 
0 . 2 32208 
3.8736 

2 .507623 
0 . 002521 
0.497185 
0.000693 
0.619689 
6. 408782 
0.779111 



S 

“O . 353212 
0 . 04428 
0. 078764 
“O . 014002 
“0 . 168073 
0 . 021375 
“O. 101243 

0.229846 
“O . 109196 
0.07591 
“O . 157238 
“O . 06209 
“O . 034354 
“O . 048284 

U 

“2.790196 
0. 00732 
0 . 074671 
“O . 0 16921 
“0.282194 
“2 . 190218 
0. 8602 08 

“0.243511 
0.633956 
“O . 258073 
1 .245263 
“1.634225 
0. 320814 
3. 739978 

X 

“0.826655 
“O. 162965 
“O. 013841 
0. 046024 
“0.159988 
“O. 054455 
0. 026573 

29 . 055833 
“l . 592162 
“O . 5 1989 
“O. 118133 
“0.005627 
“0.172355 
“0.021464 



T 

0. 017917 
0. 065939 
0.315323 
0.793914 
0. 318064 
0. 109561 
0.006547 

“O. 004464 
0. 529989 
0.590159 
0.002881 
0. 053288 
0.016494 
“0.007198 

S 

0.462755 
"O. 089016 
"0.022291 
0. 001838 
0. 011759 
0. 140414 
"O . 094002 

"0.152454 
'O. 004342 
0. 001918 
'0.003933 
'0.099786 
'0 . 049615 
'O. 038501 

T 

'O. 069612 
0. 037994 
0. 05637 
0. 085348 
‘O. 006043 
’O. 085045 
'O . 001588 

0.104343 
0. 022486 
0.024126 
0. 027116 
0. 007372 
0 . 00384 
0.01474 



TABLE 3 



T 

0.006997 
0.002754 
0. 002087 
0.00108 
0. 0172 18 
0 . 019836 
0. 016774 

0. 019333 
0. 010881 
0. 010872 
0. 046304 
0. 018155 
0. 04825 
0.039221 
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APPENDIX B 



ELEVEN YEAR AVERAGES 



TABLE B.l 

Average Separations 



El-3 


E4 


E5 


E6 


E7 


E8 


2057 


1254 


119 


1 5 


11 


2 


9720 


7688 


969 


41 


12 


2 


7183 


11837 


4210 


43 


18 


3 


8535 


241 07 


1 7792 


146 


25 


4 


740 


201 4 


2512 


159 


32 


4 


404 


917 


2069 


560 


.40 


4 


111 


349 


119 7 


915 


48 


6 


79 


296 


1282 


1433 


80 


7 


42 


249 


1278 


188 0 


197 


6 


49 


229 


1 078 


1712 


242 


6 


18 


97 


488 


1066 


278 


1 0 


14 


87 


44 3 


11 36 


398 


25 


7 


4 5 


359 


1299 


6 75 


66 


8 


51 


384 


1434 


844 


103 


7 


48 


374 


1548 


1145 


182 


6 


40 


279 


1175 


1003 


188 


5 


24 


140 


7 09 


817 


213 


3 


15 


108 


521 


681 


188 


2 


11 


61 


280 


543 


185 


2 


8 


3 6 


189 


426 


162 


1 


6 


19 


95 


280 


128 


1 


4 


14 


64 


192 


9 4 


1 


3 


11 


45 


148 


81 


1 


2 


9 


33 


128 


77 


1 


2 


5 


28 


1 02 


66 


1 


2 


5 


15 


76 


54 


1 


1 


2 


9 


57 


42 


1 


1 


2 


6 


36 


30 


1 


1 


1 


3 


1 7 


9 


1 


1 


1 


3 


1 0 


7 


1 


1 


2 


2 


11 


7 



E9 

2 

1 

2 

2 

2 

3 

3 

4 

4 

4 

4 

4 

6 

9 

25 

32 

52 

55 

68 

68 

65 

54 

60 

57 

59 

48 

47 

29 

11 

9 

6 



55 



L-3 

307 

104 

038 

666 

579 

3 51 

91 

62 

33 

39 

14 

11 

5 

5 

6 

5 

4 

2 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



E9 

1 

0 

1 

1 

1 

2 

2 

3 

3 

3 

3 

3 

5 

8 

23 

31 

51 

54 

67 

67 

64 

53 

59 

55 

58 

47 

46 

28 

10 

8 

5 



TABLE B.2 



AVERAGE ELIGIBLES 



E4 


E5 


E6 


E7 


E8 


12 3 4 


116 


14 


9 


1 


7573 


960 


40 


10 


1 


11693 


4187 


41 


16 


1 


2 3 7 7 0 


17690 


144 


24 


3 


19 82 


2488 


157 


31 


3 


898 


2050 


556 


38 


3 


339 


11 89 


911 


47 


5 


288 


12 71 


1428 


79 


6 


240 


1266 


1873 


196 


5 


221 


10 6 5 


1706 


241 


5 


9 4 


483 


1061 


276 


9 


84 


440 


1132 


39 7 


24 


43 


35 5 


1296 


673 


65 


49 


380 


1429 


842 


102 


45 


37 0 


1544 


1143 


181 


38 


275 


1172 


10 01 


187 


22 


138 


707 


816 


212 


14 


107 


519 


679 


187 


10 


5 9 


278 


541 


184 


7 


3 5 


187 


423 


161 


5 


18 


93 


278 


127 


3 


13 


61 


189 


93 


2 


1 0 


42 


144 


79 


1 


8 


30 


124 


76 


1 


4 


24 


9 7 


65 


1 


3 


12 


71 


52 


0 


1 


8 


53 


40 


0 


1 




3 2 


28 


0 


0 


2 


14 


8 


0 


0 


2 


8 


6 


0 


1 


1 


9 


6 



56 



1-3 

19 0 

841 

63 9 

260 

432 

127 

56 

43 

19 

24 

6 

6 

3 

4 

3 

2 

2 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 



TABLE B.3 



AVERAGE NON-REENLISTMENTS 



E4 


E5 


E6 


E7 


E8 


E9 


1134 


102 


11 


9 


2 


1 


6343 


647 


30 


10 


2 


1 


9600 


2 59 4 


30 


16 


2 


2 


21297 


13434 


52 


22 


4 


2 


1536 


1711 


79 


28 


4 


2 


525 


1361 


354 


34 


3 


3 


190 


566 


4 04 


37 


6 


3 


154 


511 


460 


37 


6 


4 


91 


396 


428 


38 


5 


4 


83 


347 


39 7 


5 0 


5 


3 


27 


107 


1 82 


55 


5 


3 


21 


82 


142 


52 


7 


4 


11 


55 


1 04 


47 


5 


3 


13 


45 


99 


46 


7 


3 


9 


35 


78 


40 


6 


3 


6 


21 


53 


3 5 


7 


2 


3 


11 


24 


23 


5 


2 


2 


7 


19 


21 


6 


3 


2 


4 


10 


19 


4 


3 


2 


5 


9 


19 


5 


3 


2 


3 


6 


1 3 


3 


2 


1 


2 


5 


11 


3 


2 


1 


2 


4 


14 


3 


1 


1 


2 


5 


16 


3 


3 


1 


1 


4 


13 


3 


3 


1 


1 


3 


11 


3 


2 


1 


1 


2 


9 


3 


2 


1 


1 


1 


6 


2 


2 


1 


1 


1 


4 


2 


1 


1 


1 


1 


3 


1 


1 


1 


1 


1 


7 


3 


1 
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APPENDIX C 



APL PROGRAMS 



The main programs are REGR and RESID which perform the 
ridge regression computations and develops the residuals and 
their properties (resp.). These are used in FCAST which compute 
the regression forecast for the entire 31 by 7 set. Individual 
cell forecasts are computed by PRED. The other programs prepare 
the data. 

More explicitly, the raw data consist of five 11 by 31 
by 9 arrays Dl, D2 , ... , D5 which carry the eleven year 

values of non-reenlistments, separations, eligibles, retentions, 
inventory, (resp.). The program COMPRESS merely telescopes 
pay grades El, E2 , E3 together producing 11 x 31 x 7 arrays. 

This must be done separately. 

The function FCAST requires an explicit input vector 
CR which is the set of partition boundaries for grouping the 
range space of the 217 eleven year means of the object variable. 
It also uses the data array DD implicitly. 

VCOMPRESSiUlV 

V Z^COMPRESr, D\V\Z\ 

Cl] P^^(p7))p0 

[2] n;; 1 2 3]^1 

[3] Zl^-\-/V^D 

[4] Z-*- 0 0 3 

[ 5 ] Z^Zl.Z 

V 
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VFCASTiUlV 

V PP-i-FCAST CR',J\P 
Cl] l'^((pf7Z?),3.1. ,7)p0 

[2] PP^ 31 7 pO 

[3] 

[4] 7,l:J^,r+i. 

[5] VLJ ; )-DD>CRlJ'i 

[ 6 ] P^PRET) VUx ;] 

[7] PP^PP+P 

[8] -^T.lx xJ<~l + pCR 

V 



VPi? /)[[]] V 

V POPPED I/;PP” 

[1] R^PREP V 

[2] R^RC REGR D 

[3] E^D RESID R 

[4] 31 7 pPPP-H( ,l/)\pp 

V 



The function FCAST creates a 31 by 7 screening matrix 
of zeros and ones which is used by the data shaping functions 
PREP, BUILD, and SHAPE. The latter, SHAPE, merely shaves off 
the first or last face (according to whether the variable Q 
is minus one or plus one) of the individual data arrays. The 
function BUILD assembles the object variable in its first face 
and the p regression variables in the remaining faces. This 
is the only function that needs to be changed with each appli- 
cation. The version shown is for regressing non-reenlistments . 
on inventory, previous non-reenlistments, and previous reten- 
tions. The output of PREP prepares the assembled data for 
REGR. 



59 



VPRKPiniV 

V D^PRF.P V 

[1] REBUILD V 

[2] (pZ))[ll ; ;l + i (pZ) )[3]-l] 

C3] (10x(pZ?)[2]), (pZ?)C3l )p/? 

V 



v/?z/j/i/)rniv 

V REBUILD V \S1 \S2 \T \F. %0 

C 1 ] (3-^1 

[2] .91 BHAPF. RRl 

13] E*-V SHAPE RRH 

[4] 

[5] S?.*'V SHAPE RRl 

[6] T<-V SHAPE RR5 

[7] R^S?. ^[.2 . S3 SI 

[8] R*-(R,E),T 

V 



V5/Z/lPZ^’Cn] V 

V S^V SHAPE RiNiK 

C 1 ] N-*-l IpR 

[2] K^+/+/V 

[3] V*-iN,pV)pV 

[4] R^(N,K)p(,V)/ ,R 

[5] S^i(QxN-l) ,K)1R 

V 



The inputs to REGR are the data matrix D and the 
vector of ridge constants RC. The rank of D is three; 
the first dimension (faces) indexing the problems (separate 
predictions) , the second (rows) the observations (years) , 
and the third (columns) the variables--the object variable 
coming first followed by the regression variations. The output R 
also has three dimensions. Again the faces are the prediction 
problems. For any face, the first pRC element of column one 
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v/?^;r7i?[n] V 

V R^RC REGR D \MU \3TG \X \Y iC %B ',R2 ',A \FAC \RT \N %P \ID \K \CT 

[ 1 ] RG*-,RG 

[2] -^-(2+l26 )x i3 = pp75 

[3] /)^(l.pP)p,0 

[ 4 ] P^(p/?)[3]-l 

[5] I7)^(P,P)pl .PpO 

[6] ;V^(pP)[2] 

[7] P^((ltpP),(pPr7),(P+l))pO 

[ 8 ] WW-^+/[ 2 ] DiR 

[9] A^V- -2 1 . 3 X)(N ,pMU)pMU 

[ 10 ] SIG^ + /L2:UA*2)iN-l 

[11] SIG*-GIG*0 .5 

[12] D^Ai 2 13 X?(N ,p5IG)pSTG 

[13] A^O 

[14] ft MV, GIG ARE MATRIGS OF MEAVS AND STD DEVS, 

[15] ft TEE DATA D HAS BEEN STANDARDIZED . 

[16] f\ A ms A TEMPORARY VALUE, 

[17] ft BEGIN OUTER LOO^ ON FACES, 

[18] L^iFAC^n 

[19] OUTER :FAC->-FAG+l 

[2 0] 0 1 4-P[Plf7;;] 

[21] Y^DLFAC \;ll 

[22] ^P-1 )x((s?A')+.x,r 

[23] ft BEGIN INNER LOOP ON K (RIDGE TRIAL,) 

[24] RT^O 

[25] INNER xRT^R T+ 1 

[ 26 ] K^RGlRTl 

[27] A^W.G+K^(P,P)pl,PpO 

[28] Bm+ ,^(<SfX)+ ,y<YiN-l 

[29] R2.^1-(iN -1)^+/ (Y-X+,^B)*2 

[30] RLFAG iRTil^F2 ,B 

[31] ■*INNER^\RT<pRC 

[32] ■*OUTER^\FAC<l\pD 

[33] PI ALL RIDGE REGRESSIONS COMPLETE, 

[34] ft FINISH OUTPUT ARRAY 

[3 5 ] P-«-P,[2] //[/,[ 1 . 5 ] SIG 

[36] P^P,( l+pP)pP^, 0 0 

V 



are the coefficients of determination (i.e. square of mult, 
correl.) and the first pRC elements of the last column are 
the values of the ridge constant. All intermediate columns 
contain the standardized regression coefficients in the same 
order as they appear in D. The last two rows contain the 
mean and standard deviation (resp.) of the object and re- 
gression variables. Zeros are used to square up the array. 
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The function RESID takes as input the data matrix D 
and the output R of REGR. Its explicit output E is the 
set of residuals. Implicit output includes EB, the mean 
residual; SEE, the root mean square error of forecast; and 
BB , the regression coefficients converted back to dimensional 
form. The array C is the constant term of the regression 
equations. Line 15 shows that only the positive parts of the 
predictions are used. 



VRESIDLUl^ 

V E*-P REEin R iM iS iB iMl '.FBI iSS iSl ;D1 \D2 ;PR1 ;C iCC iKK 

[1] RC^,_RC 

[2] P^( l + pi?)-2 

[31 M^RL il + pRC i\P + ll 

[4] 5-f-PC ;2+pPf7; iP+1 ] 

[5] B^RLiK’,l + \Pl 

[6] 5l^b(P,W^l + pP)p5[ ;;1] 

[7] BB-^B^SliiS-^iN ,P+1 )pS)[ ;l + iP] 

[ 8 ] 

[9] -+/W[ ; ;l + iP]x(i7,l ,P)pPP 

[10] Pl-f- 0 0 1 

[11] D9.^ 0 0 "2 4-P 

[12] 2 1 3 b( (/fX^(pP)[2] ) ,P,P)p ,BB 

[13] CC*-t?{KK,N)pC 

[14] PBl■<-0^^7^?++/BxBl 

[15] PB^OFd , (pX)[l] )pC+ + /X^{pX)pBB 

[16] E-<-{D2-<-(.N ,KK)pD2)-PRl 

[17] SEE^SL ;l]x( (1-B[ ;1] )x (KKiKK-P) ) * 0 .S 

[18] EB*-{iKK)^ + /E 

[19] BE-^-i(KK-P)T + / (E-t?(KK,N)pEB)*2 
[2 0 ] 

[21] BB*- 11 6 BET BB 

[22] C*- 11 3 DFT C 

[23] BB*-C,BB, 11 3 DFT SE*-((llpR ) ,l)pSE 

V 



62 



U179560 



library - RESEARCH REPORTS 
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